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Abstract 

We present a number of positive and negative results for variants of the matroid secretary 
problem. Most notably, we design a constant-factor competitive algorithm for the "random as- 
signment" model where the weights arc assigned randomly to the elements of a matroid, and then 
the elements arrive on-line in an adversarial order (extending a result of Soto [21]). This is under 
the assumption that the matroid is known in advance. If the matroid is unknown in advance, 
we present an 0(log r log n)-approximation, and prove that a better than 0(logn/loglogn) 
approximation is impossible. This resolves an open question posed by Babaioff et al. [3]. 

As a natural special case, we also consider the classical secretary problem where the number 
of candidates n is unknown in advance. If n is chosen by an adversary from {1, . . . , N}, we 
provide a nearly tight answer, by providing an algorithm that chooses the best candidate with 
probability at least l/(_ffjv-i + 1) and prove that a probability better than 1/Hm cannot be 
achieved (where Hn is the A-th harmonic number). 

1 Introduction 

The secretary problem is a classical problem in probability theory, with obscure origins in the 1950's 
and early 60's ([12, 18, 9]; see also [11])- The goal in this problem is to select the best candidate 
out of a sequence revealed one-by-one, where the ranking is uniformly random. A classical solution 
finds the best candidate with probability at least 1/e [11]. Over the years a number of variants 
have been studied, starting with [13] where multiple choices and various measures of success were 
considered for the first time. 

Recent interest in variants of the secretary problem has been motivated by applications in 
on-line mechanism design [15, 19, 3], where items are being sold to agents arriving on-line, and 
there are certain constraints on which agents can be simultaneously satisfied. Equivalently, one can 
consider a setting where we want to hire several candidates under certain constraints. Babaioff, 
Immorlica and Kleinberg [3] formalized the matroid secretary problem and presented constant-factor 
competitive algorithms for several interesting cases. The general problem formulated in [3] is the 
following. 



Matroid secretary problem. Given a matroid Ai = (E,I) with non- negative weights assigned 
to E\ the only information known up-front is the number of elements n := \E\. The elements of E 
arrive in a random order, with their weights revealed as they arrive. When an element arrives, it 
can be selected or rejected. The selected elements must always form an independent set in A4, and 
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a rejected element cannot be considered again. The goal is to maximize the expected weight of the 
selected elements. 

Additional variants of the matroid secretary problem have been proposed and studied, depend- 
ing on how the input ordering is generated, how the weights are assigned and what is known in 
advance. In all variants, elements with their weights arrive in an on-line fashion and an algorithm 
must decide irrevocably whether to accept or reject an element once it has arrived. We attempt 
to bring some order to the multitude of models and we classify the various proposed variants as 
follows. 

Ordering of matroid elements on the input: 

• AO = Adversarial Order: the ordering of elements of the matroid on the input is chosen by 
an adversary. 

• RO = Random Order: the elements of the matroid arrive in a random order. 
Assignment of weights: 

• A A = Adversarial Assignment: weights are assigned to elements of the matroid by an adver- 
sary. 

• RA = Random Assignment: the weights are assigned to elements by a random permutation 
of an adversarial set of weights (independent of the input order, if that is also random). 

Prior information: 

• MK = Matroid Known: the matroid is known beforehand (by means of an independence 
oracle) . 

• MN = Matroid - n known: the matroid is unknown but the cardinality of the ground set is 
known beforehand. 

• MU = Matroid - Unknown: nothing about the matroid is known in advance; only subsets of 
the elements that arrived already can be queried for independence. 

For example, the original variant of the matroid secretary problem [3], where the only informa- 
tion known beforehand is the total number of elements, can be described as RO-AA-MN in this 
classification. We view this as the primary variant of the matroid secretary problem. 

We also consider variants of the classical secretary problem; here, only 1 element should be 
chosen and the goal is to maximize the probability of selecting the best element. 

Classical secretary problems: 

• CK = Classical - Known n: the classical secretary problem where the number of elements in 
known in advance. 

• CN = Classical - known upper bound N: the classical secretary problem where the number 
of elements is chosen adversarially from {1, . . . , A^}, and N is known in advance. 

• CU = Classical - Unknown n: the classical secretary problem where no information on the 
number of elements is known in advance. 

Since the independent sets of the underlying matroid in this model are independent of the particular 
labeling of the ground set (i.e., RO-AA-CK, AO-RA-CK and RA-RO-CK models are equivalent), 
we just use the weight assignment function to characterize different variants of this model. The 
classical variant of the secretary problem which allows a 1/e-approximation would be described 
as RA-CK. The variant where the number of elements n is not known in advance is very natural 
- and has been considered under different stochastic models where n is drawn from a particular 
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distribution [23, 1] - - but the worst-case scenario does not seem to have received attention. We 
denote this model RA-CU, or RA-CN if an upper bound on the number of candidates is given. 
In the model where the input ordering of weights is adversarial (AA-CK), it is easy to see that 
no algorithm achieves probability better than 1/n [5]. We remark that variants of the secretary 
problem with other objective functions have been also proposed, such as discounted profits [2], and 
submodular objective functions [4, 14]. We do not discuss these variants here. 

1.1 Recent related work 

The primary variant of matroid secretary problem (RO-AA-MN model) was introduced in [3]. 
In the following, let n denote the total number of elements and r the rank of the matroid. An 
0(log r)-approximation for the RO-AA-MN model was given in [3]. It was also conjectured that a 
constant-factor approximation should exist for this problem and this question is still open. Very re- 
cently, Chakraborty and Lachish [7] improved [3] by giving an 0(^\og r)-approximation algorithm. 
Constant-factor approximations were given in [3] for some special cases such as partition matroids 
and graphic matroids with a given explicit representation. Further, constant-factor approximations 
were given for transversal matroids [8, 20] and laminar matroids [17]. However, even for graphic 
matroids in the RO-AA-MK model when the graphic matroid is given by an oracle, no constant 
factor is known. 

Babaioff et al. in [3] also posed as an open problem whether there is a constant-factor ap- 
proximation algorithm for the following two models: Assume that a set of n numerical values are 
assigned to the matroid elements using a random one-to-one correspondence but that the elements 
are presented in an adversarial order (AO-RA in our notation). Or, assume that both the assign- 
ment of values and the ordering of the elements in the input are random (RO-RA in our notation) . 
The issue of whether the matroid is known beforehand is left somewhat ambiguous in [3]. 

In a recent work [21], Jose Soto partially answered the second question, by designing a constant- 
factor approximation algorithm in the RO-RA-MK model: An adversary chooses a list of non- 
negative weights, which are then assigned to the elements using a random permutation, which is 
independent of the random order at which the elements are revealed. The matroid is known in 
advance here. 

1.2 Our results 

Matroid secretary. We resolve the question from [3] concerning adversarial order and random 
assignment, by providing a constant-factor approximation algorithm in the AO-RA-MK model, and 
showing that no constant-factor approximation exists in the AO-RA-MN model. More precisely, we 
prove that there is a 40/(1 — l/e)-approximation in the AO-RA-MK model, i.e. in the model where 
weights are assigned to the elements of a matroid randomly, the elements arrive in an adversarial 
order, and the matroid is known in advance. We provide a simple thresholding algorithm, which 
gives a constant-factor approximation for the AO-RA-MK model when the matroid Ai is uniformly 
dense. Then we use the principal sequence of a matroid to design a constant-factor approximation 
for any matroid using the machinery developed by Soto [21]. (Subsequently to our work, Soto [22] 
improved our approximation factor in the AO-RA-MK model to 16/(1 — 1/e).) 

On the other hand, if the matroid is not known in advance (AO-RA-MN model), we prove that 
the problem cannot be approximated better than within f2(logn/loglogn). This holds even in 
the special case of rank 1 matroids; see below. On the positive side, we show an O(logrlogn)- 
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approximation for this model. We achieve this by providing an 0(log r)-approximation thresholding 
algorithm for the AO-AA-MU model (when both the input ordering and the assignment of weights 
to the elements the matroid are adversarial) , when an estimate on the weight of the largest non-loop 
element is given. Here, the novel technique is to employ a dynamic threshold depending on the 
rank of the elements seen so far. 

Classical secretary with unknown n. A very natural question that arises in this context is 
the following. Consider the classical secretary problem, where we want to select 1 candidate out 
of n. The classical solution relies on the fact that n is known in advance. However, what if we do 
not know n in advance, which would be the case in many practical situations? We show that if an 
upper bound N on the possible number of candidates n is given (RA-CN model: i.e., n is chosen 
by an adversary from {1, . . . , -/V}), the best candidate can be found with probability 1/(Hn_\ + 1), 
while there is no algorithm which achieves probability better than 1/Hn (where Hn = ^2iL\ j is 
the iV-th harmonic number). 

In the model where we maximize the expected value of the selected candidate, and n is 
chosen adversarially from {1,...,N}, we prove we cannot achieve approximation better than 
17 (log N/ log log N). On the positive side, even if no upper bound on n is given, the maximum- 
weight element can be found with probability e/log 1+€ n for any fixed e > 0. We remark that 
similar results follow from [16] and [10] where an equivalent problem was considered in the context 
of online auctions. More generally, for the matroid secretary problem where no information at all 
is given in advance (RO-AA-MU), we achieve an log r log 1+e n) approximation for any e > 0. 
See Table 1 for an overview of our results. 



Problem 


New approximation 


New hardness 


RA-CN 


H N -i + 1 


Hn 


RA-CU 


<9(Mog i+e n) 


0(log n) 


AO-RA-MK 


40/(1 - l/e) 




AO-RA-MN 


0(logr logn) 


0, (log n j log log n) 


AO-RA-MU 


0(~ logr log 1+e n) 


0, (log n j log log n) 


RO-AA-MU 


0(Mogrlog i+£ n) 


Q (log n j log log re) 



Table 1: Summary of results 



Organization. In section 2 we provide a 40/(1 — l/e) approximation algorithm for the AO-RA- 
MK model. In section 3 we provide an O(logralogr) approximation algorithm for the AO-RA-MN 
model, and an log r log 1+e n) approximation for the RO-AA-MU model. Finally, in section 4 
we provide a (Hn— i + l)-approximation and -ffjv-hardness for the RA-CN model. 

2 Approximation for adversarial order and random assignment 

In this section, we derive a constant-factor approximation algorithm for the AO-RA-MK model, i.e. 
assuming that the ordering of the elements of the matroid is adversarial but weights are assigned to 
the elements by a random permutation, and the matroid is known in advance. We build on Soto's 
algorithm [21], in particular on his use of the principal sequence of a matroid which effectively 
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reduces the problem to the case of a uniformly-dense matroid while losing only a constant factor 
(1 — 1/e). Interestingly, his reduction only requires the randomness in the assignment of weights 
to the elements but not a random ordering of the matroid on the input. Hence, it is sufficient 

to obtain a constant factor for uniformly dense matroids. Recall that the density of a set in a 

I S I 

matroid At = (E,Z) is the quantity 7(5) = ^U. A matroid is uniformly dense, if 7(5) < j(E) 
for all 5 C E. We present a simple thresholding algorithm which works in the AO-RA-MK model 
(i.e. even for an adversarial ordering of the elements) for any uniformly dense matroid. Combining 
our algorithm with Sotos reduction [21, Lemma 4.4], we obtain a constant-factor approximation 
algorithm for the matroid secretary problem in AO-RA-MK model. 

Throughout this section we use the following notation. Let Ai = (E, I) be a uniformly dense 
matroid of rank r. This also means that Ai contains no loops. Let \E\ = n and let e\, e-ii ■ ■ ■ % e n 
denote the ordering of the elements on the input, which is chosen by an adversary (i.e. we consider 
the worst case). Furthermore, the adversary also chooses W = {w\ > W2 > . ■ ■ > w n }, a set 
of non-negative weights. The weights are assigned to the elements of Ai via a random bijection 
to : E — > W. For a weight assignment u, we denote by w(S) = X^eGS ,a; ( e ) the weight of a set 
5, and by uj(S) = {u(e) : e £ 5} the set of weights assigned to 5. We also let OPT(w) be the 
maximum- weight independent set in Ai. 

2.1 Approximation for uniformly dense matroids 

We show that there is a simple thresholding algorithm which includes each of the topmost [r/4j 
weights (i.e. w±, . . . , w^/^ ) with a constant probability. This will give us a constant factor approx- 
imation algorithm, as u>(OPT(u;)) < Yll=i w ii where w\ > W2 > ■ ■ ■ > w r are the r largest weights 
in W. It is actually important that we compare our algorithm to the quantity ^2l = i"Wi, because 
this is needed in the reduction to the uniformly dense case. 

The main idea is that the randomization of the weight assignment makes it very likely that the 
optimum solution contains many of the top weights in W. Therefore, instead of trying to compute 
the optimal solution with respect to w, we can just focus on catching a constant fraction of the top 
weights in W. Let A = {ei, . . . , e n / 2 } denote the first half of the input and B = {e n y 2 +i, ■ ■ ■ , e n } 
the second half of the input. Note that the partition into A and B is determined by the adversary 
and not random. Our solution is to use the [r / 4J + 1-st topmost weight in the "sampling stage" A 
as a threshold and then include every element in B that is above the threshold and independent of 
the previously selected elements. Details are described in Algorithm 1. 

Theorem 2.1. Let Ai be a uniformly dense matroid of rank r, and ALG(o;) be the set returned by 
Algorithm 1 when the weights are defined by a uniformly random bijection uj : E —} W . Then 

E w MALG(w))]>ij>i 

i=l 

where {w\ > W2 > ... > w r } are the r largest weights in W . 

If r < 12, the algorithm finds and returns the largest weight w\ with probability 1/e (step 
2; the optimal algorithm for the classical secretary problem). Therefore, for r < 12, we have 

e w malgm)] > ^ ELi > A E[=i 
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Algorithm 1 Thresholding algorithm for uniformly dense matroids in AO-RA-MK model 
Input: A uniformly dense matroid M = (E,I) of rank r. 
Output: An independent set ALG C E. 
1: if r < 12 then 

2: run the optimal algorithm for the classical secretary problem, and return the resulting sin- 
gleton. 
3: end if 
4: ALG <- 

5: Observe a half of the input (elements of A) and let w* be the ([^/4J + l) s * largest weight among 
them. 

6: for each element e £ B arriving afterwards do 

7: if co(e) > w* and ALG U {e} is independent then 

8: ALG <- ALG U {e} 

9: end if 
10: end for 
11: return ALG 



For r > 12, we prove that each of the topmost \r/A\ weights will be included in ALG(cj) with 
probability at least 1/8. Hence, we will obtain 

, Lr/4J r 

E w [w(ALG(lu))] > - J2 > 45 J>'- C 1 ) 

i=l i=l 

Let t = 2[r/4\ + 2. Define C"(w) = {ej : uj(e,j) > "UJ^} to be the set of elements of J\A which 
get one of the top t weights. Also let A'(u) = C'(uj) n A and B'(u) = C'(u) n S. Moreover, 
for each 1 < i < t we define C'^u) = {ej : w(ej) > wt Sz oj(ej) ^ Wi}, A'^oj) = C'^oj) n A and 
B'^lo) = C[{oo) n i?, i.e. the same sets with the element of weight Wi removed. 

First, we fix i < [r/4\ and argue that the size of B'^oo) is smaller than A'^u) with probability 
1/2. Then we will use the uniformly dense property of A4 to show that the span of B[{uj) is 
also quite small with probability 1/2 and consequently w% has a good chance of being included in 
ALG(w). 

Claim 2.2. Let M be a uniformly dense matroid of rank r, t = 2[r/4\ +2, 1 < i < \r/4\, and 
B[{ijS) defined as above. Then we have 

P w [l^'HI < Lr/4J] = 1/2. (2) 

Proof. Consider C 4 '(w), the set of elements receiving the top t weights except for u>j. This is a 
uniformly random set of odd size t — 1 = 2|_r/4j + 1. By symmetry, with probability exactly 1/2, 
a majority of these elements are in A, and hence at most \r/A\ of these elements are in B, i.e. 
\Bi( U )\ < Lr/4J. □ 

Now we consider the element receiving weight Wi. We claim that this element will be included 
in ALG(w) with a constant probability. 

Claim 2.3. Let M be a uniformly dense matroid of rank r, and i < [r/4\ . Then 

P w [to-Hwi) e ALG (a;)] > 1/8. 
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Proof. Condition on C 4 '(oj) = S for some particular set S of size t — 1 such that |J3^(u>)| = |Sn.B| < 
[r/4j . This fixes the assignment of the top t weights except for w%. Under this conditioning, weight 
Wi is still assigned uniformly to one of the remaining n — t + 1 elements. 

Since we have |^4-(w)| = \S H A\ > \r/A\ + 1, the threshold w* in this case is one of the top t 
weights and the algorithm will never include any weight outside of the top t. Therefore, we have 
ALG (a;) C B'(ui). The weight Wi is certainly above w* because it is one of the top [r/4j weights. It 
will be added to ALG(w) whenever it appears in B and it is not in the span of previously selected 
elements. Since all the previously included elements must be in B'^lj) = S fl B, it is sufficient to 
avoid being in the span of S fl B. To summarize, we have 



to 



- l (wi) E B\span(SnB) => co~ 1 (wi) E ALG(w). 



What is the probability that this happens? Similar to the proof of [21, Lemma 3.1], since M is 
uniformly dense, we have 

\span(S n B)\ \span(S n B)\ n n.„ „, n 

^ — ~n — to < - => \span(SnB)\ < -\S D B\ < 



\S H B\ rank{span(S Pi B)) r r 4 

using |iSTl B\ < [r/4j. Therefore, there are at least n/4 elements in B \ span(S fl .B). Given that 
the weight wi is assigned uniformly at random among n — t possible elements, we get 

P,, \lo- 1 { Wi ) £B\ span{SC\B) \ C'Au) = S] > > - A . 

n — t 4 

Since this holds for any S such that \S n B\ < [r/4\, and SnB = C|nB = B'^uj), it also holds 
that 

P. eB\span{B[{u)) \ \B[{uj)\ < \r/A\] > I 

Using Claim 2.2, we get V u [w _1 (^) ^ B \ span{B[{uj))] > 1/8. □ 



This finishes the proof of Theorem 2.1. 



2.2 Extension to general matroids 

In this section we describe the final 40/(1 — 1/e) approximation algorithm for AO-RA-MK model for 
general matroids. The algorithm is based on Soto's algorithm [21], by decomposing the underlying 
matroid into a sequence of principal minors and then running Algorithm 1 in parallel on each of 
them separately. 

Algorithm 2 Thresholding algorithm for matroid secretary problem in AO-RA-MK model 
Input: A matroid M = (E,l). 
Output: An independent set ALG C E. 
1: Compute the sequence of principal minors (Mi)f =1 : Initialize k = 0. While (Ji=i E i ¥= E , let 

Ek+i be the densest set in the matroid A4/(J/ =1 £'i, define M.k+1 = Ui=i Ei)\ E k+l> an( i 

increment k. 

2: Run Algorithm 1 in parallel on each Mi to get a solution Jj, and return ALG = Ui=i h- 
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We use Soto's lemma to argue that if the weights are assigned randomly to the elements, and 
we achieve an a-fraction of the sum of the ri topmost weights in each principal minor Mi, then we 
obtain an q/(1 - 1/e) approximation overall. 

Interestingly, it is necessary to know the matroid in advance, in order to discriminate the dense 
parts of the matroid from the sparse parts (by computing the principal minors), and try to handle 
them separately. Otherwise, as we prove later, no algorithm can do better than an O (log nj log log n) 
approximation. 

Corollary 2.4. Algorithm 2 gives a jz§j^~ approximation in the AO-RA-MK model. 

Proof. Similar to the proof of Theorem 2.1, let e\, . . . , e n be the sequence of elements of M designed 
by the adversary and W = W\ > . . . > w n be the hidden list of weights. Let Mi, 1 < i < k, be the 
sequence of principal minors of M, with ground set Ei and rank r^, and let V denote a partition 
matroid as defined in [21, Section 4], with ground set E and independent sets 

AV) = \\Jli-IiQEi,\Ii\<r)^. 

For a uniformly random bijection u) : E — > W, let OPT-p(w) be the maximum weight of an 
independent set in matroid V , and ALG(w) be the set returned by Algorithm 2. Conditioning on 
the set of weights assigned to the elements of each block Ei, the elements in Ei receive a random 
permutation of this set of weights. Since each Mi is uniformly dense, By Theorem 2.1, Algorithm 
2 recovers in expectation a 1 /40-fraction of the sum of the heaviest weights assigned to elements 
in Ei. However, the union of the heaviest ri elements in each Ei is indeed the optimum solution in 
the partition matroid V . By removing the conditioning we get 

E w [w(ALG(u))] > ±E W [OPTVH] . (3) 

Moreover, Soto in [21] proved that E w [OPT-p] is only a constant factor away from the optimum of 
E w [OPT^]. 

Lemma 2.5 (Soto [21]). E w [w(OPT v )(u))] > (1 - l/e)E w [w{OPT m (oj))] . 

This proves the corollary. □ 



3 Approximation algorithms for unknown matroids 

In this section we will be focusing mainly on the AO-RA-MN model, i.e. assuming that the ordering 
of the elements of the matroid is adversarial, weights are assigned randomly, but the matroid is 
unknown, and the algorithm only knows n in advance. We present an O(lognlogr) approximation 
algorithm for the AO-RA-MN model, where n is the number of elements in the ground set and r 
is the rank of the matroid. It is worth noting that in these models the adversary may set some of 
the elements of the matroid to be loops, and the algorithm does not know the number of loops in 
advance. For example it might be the case that after observing the first 10 elements, the rest are all 
loops and thus the algorithm should select at least one of the first 10 elements with some non-zero 
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probability. This is the idea of the counterexample in section 4 (Corollary 4.4), where we reduce AO- 
RA-MN, AO-RA-MU models to RA-CN, RA-CU models respectively, and thus we show that there 
is no constant-factor approximation for either of the models. In fact, no algorithm can do better 
than f2(logn/loglogn). Therefore, our algorithms are tight within a factor of O (log r log log n) or 
0(log r log 6 n). 

At the end of this section we also give a general framework that can turn any a approximation 
algorithm for the RO-AA-MN model, (i.e. the primary variant of the matroid secretary problem) 
into an 0(a log 1+e n/e) approximation algorithm in the RO-AA-MU model (see subsection 3.2). 

We use the same notation as section 2: Ai = (E, I) is a matroid of rank r (which is not known 
to the algorithm), and ei,e2,...,e n is the the adversarial ordering of the elements of A4, and 
W = {wi > W2 > ■ ■ ■ > w n } is the set of hidden weights chosen by the adversary that are assigned 
to the elements of A4 via a random bijection uj : E — >■ W . 

3.1 Approximation for AO-RA-MN models 

We start by deriving an O(lognlogr) approximation algorithm for the AO-RA-MN model. Our 
algorithm basically tries to ignore the the loops and only focuses on the non-loop elements. We 
design our algorithm in two phases. In the first phase we design a randomized algorithm that 
works even in the AO-AA-MU model assuming that it has a good estimate on the weight of the 
largest non-loop element. In particular, fix bijection uj : W —> E, and let e\ be the largest non- 
loop element with respect to w, and e\ be the second largest one. We assume that the algorithm 
knows a bound ^(e^) < L < oo{e[) on the largest non-loop element in advance. We show there 
is a thresholding algorithm, with a non-fixed threshold, that achieves an O(logr) fraction of the 
optimum (see subsection 3.1.1). 

In order to solve the original problem, in the second phase we divide the non-loop elements into 
a set of blocks B\, B2, ■ ■ ■ , B\ ogn , and we use the previous algorithm as a module to get an 0(log r) 
of optimum within each block (see subsection 3.1.2). 

3.1.1 Approximation for AO-RA-MN model, with an estimate on the largest weight 

Let us start by the first phase. Since our algorithm works in a more general model, here we assume 
that we are in the AO-AA-MU model, i.e. assuming that both the ordering of the elements and 
assignments of the weights are chosen adversarially, and the algorithm knows nothing except a 
bound w^) < L < w(e*) on the largest non-loop element. We design a randomized O(logr) 
approximation algorithm for this model. 

Note that if r is also known in advance then a simple variant of the thresholding algorithm 
of Babaioff et al. [3, ThresholdPrice Algorithm] would be a O(logr) approximation. Indeed it is 
sufficient to select a threshold L/2 1 , for < % < logr uniformly at random, and then include all the 
elements above the threshold that are independent of the elements chosen so far. Here, since we do 
not know r, our algorithm keeps track of the rank of the elements seen so far, and tries to update 
the threshold according to it. In particular, once the rank of the elements seen so far reaches 2 l , the 
algorithm inserts a new threshold dynamically and works with it as if it exists since the beginning 
of the algorithm. The details are described in Algorithm 3: 

Let E\ be the event the algorithm chooses the option in step 1. Also let r*(t) and w*(t) be the 
value of r* and w*, respectively, after observing the first t elements of the input. In particular, r*(n) 
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Algorithm 3 Algorithm for AO-AA-MU model, when an estimate of the largest non-loop element 

is known 

Input: The bound L such that < L < oj(e\). 

Output: An independent set ALG C E. 

1: with probability 1/2, pick a non-loop element with weight above L and return it. 

2: ALG <- and r* <- 2. 

3: set threshold w* <— L/2. 

4: for each arriving element ej do 

5: if oj{ei) > w* and ALG U {e{\ is independent then 
6: ALG <- ALG U {ej 
7: end if 

8: if rank({ei, . . . , e{\) > r* then 

9: with probability log 2 r * set w* <— L/2r*. 
10: r* 4- 2r*. 
11: end if 
12: end for 
13: return ALG 



will be the rank of M, and w*(n) will be the final value of the threshold chosen by the algorithm. 
The following observation describes some properties of the algorithm: 

Observation 3.1. Assuming ~^S\, for any matroid of rank r, observe that r*(n) in the algorithm 
will be the smallest power of 2 greater than r (i.e. r*(n) < 2r). Therefore, the algorithm will choose 
between at most log (2r) different thresholds, where for each i, the threshold w* (t) will be decreased 
to L/2 1 at the first time t(i) where rank{e\, . . . , e t ^) = 2 l ~ l , with probability 1/i. 

Hence, by applying a simple induction it is not hard to see that at any time t in the execution 
of the algorithm, 

w*(t) 



1 < i < logr*(t), P 



l/logr*(t), (4) 



2 

where the probability is over all of the randomization in the algorithm. 

Theorem 3.2. For any matroid Ai = (E,I) of rank r, and any bijection uj : E — > W , given the 
bound oo{e^) < L < uo(e\), Algorithm 3 is a 16 log r approximation in the AO-AA-MU model, i.e. 

EKALG(w))] > — — u>(OPT(w)), 
lb log r 

where the expectation is over all of the randomization in the algorithm. 

Let us partition the elements of OPT(w) according to their weights, where 

l<i<log2r: P i = |eGOPT(a;) : | < w(e) < ^j. (5) 

First in the next claim, we show that conditioned on w*(n) = L/2 1 (and _, <fi), the expected 
weight of ALG(w), is a constant fraction of w(Pi), unless the size of |Pj| is very small. In the 
latter case as we will show in equation (7), we may charge w{Pi) by a 1/logr fraction of w(e*). 
Since Ex occurs with constant probability, the algorithm achieves a constant fraction of w(e*) which 
completes the proof. 
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Claim 3.3. For any 1 < i < logr, if |Pj| > 2 l , then 



E 



w(ALG(w))K(n) 



L 



A-.fi 



Proof. Let P, = {ei, . . . , e,} be the set of the first i elements. Recall that t{i) is the first time t 
where rank(Et) = 2 l ~ l . Since Pj C OPT(a;) is an independent set of Ai, we have \PiPiE t ^\ < 2 l ~ l . 
In other words, we must have seen at most 2* _1 elements of the set Pj by the time t(i). 

Suppose w*{n) = L/2 t ; since w*(t) is a non-increasing function oft (with probability 1), we get 
w*(t) > L/2*. Since Pj \ EW^ is an independent set and all its elements will come after t(i), we get 
|ALG(cj)| > |p \P t (j)| > |Pj| — 2 4_1 by the end of the algorithm. But all these elements are greater 
than w*(n) = L/2*, thus: 



E 



w(ALG(u))\w*(n) 



L 



A^£i 



PlL 



>|P\P^)I^>^TT>7^ 



where the last inequality follows from equation (5). 



□ 



Now we are ready to prove Theorem 3.2 
Proof of Theorem 3.2. Using the above claim we may simply compute the overall performance of 
the algorithm: 



E HALG(w))] 



^E [w(ALGM)|£i] + ^E KALG(w))h5i] 



> \^\) + \ E E 

i:|P;|>2 4 

w(ef) + 1_ ^ 

log 2r 



> 



> 



2 



w(ALG(w)) w*(n) 
1 







u;* (n) 


L 




— A -ifi 
2* 


P 


~ ¥ 





4 log 2r 

1 ^ WP) 



+ 



^ 81og2r 2 

-1 i: 



1 



i:|P;|>2* 



4 log 2r 



> 



uKg) 'yr ^(p) 

4 ^ 81og2r' 



(6) 
(7) 

(8) 



i=l 



where inequality (6) follows from equation (4) and Claim 3.3, inequality (7) follows from the 
assumption uj{e\) > L, and inequality (8) follows from w{Pi) < |Pi|o^T < 2L for |P;| < 2 l . 

The theorem simply follows from the fact that w(OPT(w)) < 2(uj(e\) + w {Pi))- D 
Before describing our algorithm for the AO-RA-MN model, we prove a bound on the per- 
formance of algorithm 3 when the bound L can be much larger than the maximum weight (i.e. 
oo{e\) <C L). This may happen as a special case when we want to apply Algorithm 3 as a subrou- 
tine. 

Corollary 3.4. For any matroid M = (E,I) of rank r, and any bijection oj : E — > W , given any 
bound L > w(e*,) we have 



> max ( o, W ( QPT M) _ 2L 
16 log r 



(9) 
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If in addition L < oj(e*), then 



EKALG(a;))]>^. (10) 



Proof. To prove the first inequality, note that if L < oj(e\), then we are done, otherwise suppose 
that we increase the weight of e\ to L + oj{e\). Define uJ = uj on all elements, except oj'{e\) = 
L + oj{e\) < 2L. Then by Theorem 3.2, we have 

E W ALG(u/))] > = ^° PT ,^» + L 

L v v >n ~ 161ogr 161ogr 

On the other hand, since in the worst case ALG(u;) does not have e\, while ALG(u/) has it, we 
have E [w(ALG(w))] > E [w(ALG(a/))] - 2L. Therefore 

E W ALG(u,))] > '"' 0PT '-» + Z - - 2L > max (o, J^EIMl _ 2L 
1 y K JJ1 ~ 161ogr ~ V 161ogr 

The second inequality can be proved simply by noting that the algorithm picks e\ in step 1 with 
probability 1/2. □ 



3.1.2 Approximation for AO-RA-MN by a general reduction 

Now we are ready to describe our final algorithm for AO-RA-MN model without knowing L in 
advance (here, unlike the previous algorithm we will use the random assignment of weights). The 
idea is to only consider the non-loop elements and divide them into a set of blocks B\ , B% , . . . , B\ og 2 n 
such that \Bi\ = 2 l (note that the number of non-loop elements can be quite smaller than n, but 
we do not know it in advance). After observing the first i blocks, we would have a good guess on 
the largest weight of the next block. Using that guess as a bound L, with probability 1/log (2n), 
we run Algorithm 3 on block i + 1 and return its solution as the final answer. The details are 
described in Algorithm 4. 

Algorithm 4 Algorithm for AO-RA-MN model 
Input: n, the number of elements. 
Output: An independent set ALG C E. 
1: Choose a number < b < log n uniformly at random. 

2: Observe the first 2 b — 1 non-loop elements without picking any of them, and let L(b) be the 

largest weight among these non-loop elements. 
3: Run Algorithm 3 only on the next 2 b non-loop elements (ignore loops), with parameters n = 2 b 

and L = L(b), and return its output. 



The next theorem proves the correctness of the algorithm 

Theorem 3.5. For any matroid Ai = (E,I) of rank r, Algorithm 4 is a O(logrlogn) approxima- 
tion in the AO-RA-MN model 

Let F be the set of non-loop elements, m := \F\, and let Fi C F be the set of first 2 l+l — 1 
non-loop elements (as a special case Fi ogm = F. We divide the elements of F into a set of blocks 
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Bq, B\, . . . , i?[i ogm j , where Bq := Fq, and for each i > 0, B{ := Fi \ Note that the size of the 

last block | .Biog „i| = m + 1 - 2 L lo s "^J can be much smaller than 2 L lo s m J . 

For a set of weights W C W and £" C E of elements such that |W| = let £w(E') be 
the event w(£") = W). Fix a set W C W of size |W'| = Throughout the proof we always 
condition on £\y* (F) . Define 

< i < Llog?nJ : O l = B ul [w(OPT{u)nBi)\£ w ,(F)] , (11) 

to be the expected value of the optimum set in each of the blocks. We will show that 

logm 

E w [ W (ALG(cj))\£ w ,(F)} > — — V B t . 

2500 log r log n ^— ' 

b to i=i 

In the next claim we show that conditioned on algorithm chooses b = i in the step 1, it will get an 
0(1/ log r) fraction of 0{. Note that in this claim we do not analyze the special case of b = [log m\ . 

Claim 3.6. If the algorithm chooses b = i < [logmj in step 1, it will get an 0(1/ log r) fraction 
ofOt: 

E w [«,(ALG(a;))|6 = i,S w ,(F)] > j^^O^ 

Proof. Fix a set of weights S = {si > S2 > ■ ■ ■ > s 2 i+i_i} C W. Conditioned on £g(Fi), there is 
a constant probability that si G uj(Bi) and s 2 ^ cj(-Bj); thus L(6) = s 2 will be a feasible bound for 
Algorithm 3. Therefore, we may apply Theorem 3.2 and obtain O(logr) fraction of Oi. Thus 

E w [w(ALG(uj))\£s(Fi),b = i,£ W '{F)} > 

> ^ [u;(ALG(w))|si G B i)S2 $ B u £ s {Fi),b = i,8 w .(F)\ 

> E w KOPT(w)nBi)|si G B;,s 2 £ B^i^H^F)] (12) 

54 log r 

128 log r 

Here inequality (12) follows from Theorem 3.2, and inequality (13) holds by noting that removing 
the condition s 2 ^ B{ can only double the expectation of OPT, while removing s\ ^ Bi may only 
decrease its expectation. The claim simply follows by summing up inequality (13) over all events 
£ S (F), for any S C W, \S\ = 2 i+1 - I. 

□ 

Now we are ready to Prove Theorem 3.5 
Proof of Theorem 3.5. We use Claim 3.6 to lower bound the expected gain of the algorithm from 
all except the last block. We need to analyze b = [log to] differently. Indeed if Bn ogm j <C m/2, the 
bound Lib) will be much larger than the largest weight in w(B^ ogm j) w.h.p. Therefore, we apply 
Corollary 3.4 for this special case. Intuitively, the loss incurs by misreporting the bound L([logmJ) 
is no more than the largest weight in W, and this can be compensated simply by selecting the 
largest weight with constant probability. 

Let L' be the largest weight in W' . By Corollary 3.4 (equation (9)), we obtain 

E w [w(ALG(w))|6 = [\ogm\,£ w ,(F)} > max fo, - 2L' J . 

\ lb log r J 
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Therefore, by Claim 3.6 and the above inequality we get: 

[log mj 

Eu KALGM)] = £ E u [w(ALG(u}))\b = i,S w ,(F)]P u [b = i\e w ,(F)] 

i=0 

/[logmj-l D \ 

> V * +max { 0; !li^-2L / } . (14) 

~ log2ra I ^ 128 log r 1 ' 16 log r ; I v ; 



In order to lower bound the RHS it suffices to show that E w [w(ALG(w))|<£V'] = ^(L'/logn). 
This simply follows from the second part of Corollary 3.4. For any block Bi, conditioned on 
V € u} (B{), with probability 1/2, the second largest weight in u(Fi), is not assigned to Bi, in which 
case algorithm achieves V with probability 1/2, once it chooses b = i: 

log m | _ | 

B u [w(ALG(u}))\£ W '(F)] = V] y — — E w [w(ALG(uj))\L' £ uj(Bi),£ W r(F)] 

i=0 s 

|^| E U) [w(Kl,G(uj))\b = i,L' eu(Bj),£ w ,(F)} 

^-^ log m log 2n 

i=o to to 

log m I R I T I J I 

> y \Bj\L = L 
~ ' 4 log m log 2ra 4 log 2n 

i= g fa fa to 

Therefore, by adding up equation (14) and 8 times equation (15) we obtain 

/[logmj-l „ 

SEuMALGMWw,] > -—[ T-90T ^ max {°' — ii^^i - 2L'} + 21/ 

log 2ra I 128 log r 16 log r 

[log mj 



O / 1 \ 

~ ^ 1281ogrlog2n V lo g rl °g n / 



8=0 

Summing both sides of the inequality over all events £yy completes the proof. □ 



3.2 Matroid secretary with unknown n 

In this subsection we consider the primary variant of the matroid secretary problem. When the total 
number of elements n is known in advance (RO-AA-MN model), there is an 0(log r)-approximation 
which was designed in [3] and is still the best known approximation for this problem. 

Here we show a simple reduction which implies that if we do not have any information about the 
matroid or the number of elements (the RO-AA-MU model), we can achieve an 0(^ log 1+e nlogr)- 
approximation for any fixed e > 0. 

Theorem 3.7. Let M be a matroid of rank r on n elements. If there is an a approximation 
algorithm for the matroid secretary problem on A4 in the RO-AA-MN model, then for any fixed 
e > 0, there is also an 0(j log 1+e n)- approximation for the matroid secretary problem on M. with 
no information given in advance (the RO-AA-MU model). 
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Proof. We guess a number n' according to a probability distribution with a polynomial tail, as 
follows: let n' = 2 l where i > is chosen with probability 



1 



Pi 



1 + e + i 



il+e" 



This distribution is chosen so that — ^ (with the remaining probability, we do nothing) 

this can be verified as follows: 



OO j OO -. „ 

g(TT^ = 1 + U(iT^- 1 + y 



1 + 



1 

1 + -. 

e 



Then we run the a-approximation algorithm as a black box, under the assumption that the number 
of elements is n'. 

Assume that the actual number of elements is n £ [2*,2 l+1 ). With probability pi, our guess of 
the number of elements is n' = 2\ If this happens, we retrieve 1/a of the expected value of the 
optimal solution on the first n' elements. Since the elements arrive in a random order, the expected 
optimum on the first n' elements is at least 1/2 of the actual optimum. Hence, in expectation we 
obtain at least 



OPT e 
Pi— — > 



1 



2a 



OPT e 

> 



1 



1 + e (l + i) 1+e 2a ~ 4a (1 + log n) l+e 



OPT. 



□ 



Therefore, if we run the O(logr) approximation of Babaioff et al. [3] as a black box we achieve 
an 0(±log 1+e nlogr) for the RO-AA-MN model: 

Corollary 3.8. For any fixed e > 0, there is an 0(- log 1+e n log r)- approximation for the matroid 
secretary problem for a matroid Ai of rank r on n elements, with no information given in advance 
(the RO-AA-MU model). In particular, assuming that M is a partition matroid matroid of rank 1, 
we obtain an 0(log 1+e n/e) approximation for the classical secretary problem, with no information 
given in advance(the CU model). 

We shall see in Section 4.2 that even in the case of r = 1 (expectation-maximizing classical 
secretary problem) where n is chosen adversarially from {1, . . . , N}, we cannot achieve a factor 
better than 0(log N/ log log N). 



4 Classical secretary with unknown n 

In this section, we consider a variant of the classical secretary problem where we want to select 
exactly one element (i.e. in matroid language, we consider a uniform matroid of rank 1). However, 
here we assume that the total number of elements n (which is crucial in the classical 1/e-competitive 
algorithm) is not known in advance - it is chosen by an adversary who can effectively terminate 
the input at any point. We consider the worst case, i.e. we want to achieve a certain probability of 
success regardless of when the input is terminated. We show that there is no algorithm achieving 
a constant probability of success in this case. However, we can achieve logarithmic guarantees and 
also prove closely matching lower bounds (see subsection 4.1). 
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In subsection 4.2 we show that even if we want to maximize the expected weight of the selected 
element, and n is known to be upper bounded by N, still no algorithm can achieve a better than 
12 (log N/ log log N) approximation factor in expectation. Consequently, we obtain that no algorithm 
can achieve an approximation factor better than Q (log N/ log log N) in the AO-RA-MN model. 

4.1 Known upper bound on n 

First, let us consider the following scenario: an upper bound N is given such that the actual number 
of elements on the input is guaranteed to be n 6 {1,2, . . . , TV}. The adversary can choose any n 
in this range and we do not learn n until we process the n-th element, (e.g., we are interviewing 
candidates for a position and we know that the total number of candidates is certainly not going 
to be more than 1000. But, we might run out of candidates at any point.) The goal is to select the 
highest-ranking element with a certain probability. Assuming the comparison model (i.e., where 
only the relative ranks of elements are known to the algorithm) , we show that there is no algorithm 
achieving a constant probability of success in this case. 

Theorem 4.1. Given that the number of elements is chosen by an adversary in {1, . . . , N} and N 
is given in advance, there is a randomized algorithm which selects the best element out of the first 
n with probability at least 1/(Hn~i + 1). 

On the other hand, there is no algorithm in this setting which returns the best element with 
probability more than 1/Hjq. Here, Hn = Y^i=i l * s ^ e N-th harmonic number. 

Our proof is based on the method of Buchbinder et al. [6] which bounds the optimal achievable 
probability by a linear program. In fact the optimum of the linear program is exactly the optimal 
probability that can be achieved. 

Lemma 4.2. Given the classical secretary problem where the number of elements is chosen by an 
adversary from {1, 2, . . . , N} and N is known in advance, the best possible probability with which 
an algorithm can find the optimal element is given by 

max 
V?i < N; 
Mi < N; 
Mi < N; 

The only difference between this LP and the one in [6] is that we have multiple constraints (16) 
instead of what is the objective function in [6]. We use essentially the same proof to argue that this 
LP captures exactly the optimal probability of success a that an algorithm can achieve. We give 
the proof for completeness; understanding the validity of this LP will be also useful for us later. 
Proof. Consider any (randomized) algorithm which finds the best element with probability at least 
a, for every possible number of incoming elements n € {1, . . . , N}. It is convenient to assume that 
the algorithm never learns n and possibly continues running beyond the first n elements (in which 
case it has failed). Let us define 

Pi = P [algorithm skips the first i — 1 candidates and chooses candidate i] . 

The probability here is over both the randomness on the input and the randomness of the algorithm 
itself. Recall that the actual number of candidates n is not known beforehand. All that the 



a, 



1 v~^ra ■ \ 
n Ei=l l Pi ^ 

Pi > 0. 



(16) 
(17) 
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algorithm knows at time i are the relative ranks of the first i candidates, which are also independent 
of n. So the probabilities pi cannot depend on n. 

Note that these are probabilities of disjoint events, so we have Y17=iPi — 1- The LP actually 
contains stronger inequalities (17). The reason why these inequalities are valid is as follows: We 
can assume w.l.o.g. that the algorithm never selects an element which is not the best so far. (Any 
algorithm can be converted to this form and perform at least as well.) The probability (over random 
permutations of the input) that the i-th candidate is the best so far is 1/i. Therefore, 

P [algorithm skips the first i — 1 and chooses i | candidate i is the best out of the first i] = 
P [algorithm skips the first i — 1 and chooses candidate i] 

— . i = xpi 

P [candidate i is the best out of the first i] 

On the other hand, the probability that the algorithm skips the first i — 1 elements is 1 — Y^j=\Pj- 
This event is independent of whether the i-th element is the best among the first i, because all the 
algorithm learns about the first i — 1 elements are their relative ranks. This proves the constraint 
(17): 

i-l 

1 — 2^Pj = P [algorithm skips the first i — 1 | candidate i is the best among the first i] < ipi. 
i=i 

The probability that the i-th candidate is the actual best candidate among the first n is 1/n. 
Conditioned on this event, candidate i is also the best among the first i candidates (and that is the 
only information available to the algorithm at that moment), so the algorithm selects candidate i 
with conditional probability exactly ipi. The total probability that the algorithm selects the best 
candidate out of the first n elements is 

n n 1 

P [success] = P [element i is optimal & algorithm selects i] = — ■ ipi. 

i=i i=i n 

We assume that the algorithm achieves success probability a for any number of candidates n 6 
{1, . . . , N} chosen by an adversary. This proves the constraint (16). 

Conversely, given a feasible solution to this LP, an algorithm can proceed as follows (see 
[6]): If it comes to the i-th element and this is the best element so far, take it with probability 
ipi/(l — ^2 % j=iPj) (which is at most 1 by (17). It can be verified by induction that the probability 
of skipping the first i — 1 elements and finding that element i is the best so far is (1 — X^ = iPj)/i, 
and hence the total probability of taking element i is exactly pi. Conditioned on element i being 
the actual optimum (which happens with probability 1/n), we take it with probability ipi. By (16), 
the success probability is at least a for any input length n. □ 

For a given N, an algorithm can explicitly solve the LP given by Lemma 4.2 and thus achieve 
the optimal probability. Theorem 4.1 can be proved by estimating the value of this LP. 
Proof of Theorem 1^.1. First, we show a feasible solution with a = H We define pi = 

1 — for each i = 1, ... ,N. This induces an algorithm as described above: if it comes to the 
i-the element and it is the best so far, we take it with probability 

ipi 1 
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By Lemma 4.2, it is sufficient to verify that (pi,a) is a feasible solution: 



1 ^ 1 

— y Wi = = ol 

n^/ 1 H N -! + 1 



implies (16), and 



~[ Hn-1 + 1 ~[ J tiN-l + 1 

implies (17). This proves that there is an algorithm with probability of success 1/(Hn-i + 1)- 

Conversely, we prove that for any feasible solution, we have a < 1/iJjv- F° r this, we in fact 
consider a weaker LP: 



max a 



Vn<N; lYTi=iiPi><x, (18) 



EiiR<l, (19) 
V« < iV; k > 0. 

Obviously, any feasible solution to (16-17) is also feasible for (18-19). Fixing a, consider a 
feasible solution to (18-19) which minimizes Y2i=iPi- We claim that ipi > a for each i. If not, take 
the first index j such that jpj < a. By (18) for n = j, there must be a smaller index j' < j such 
that j'pji > a. Then we can decrease py by 5/j' and increase pj by 5/j for some small 5 > 0, so 
that j'pf + is preserved. We can make sure that no inequality (18) is violated, because the 
left-hand side is preserved for all n > j, and the inequality was not tight for j' < n < j . On the 
other hand, J2iLiPi decreases by 5/j' — 5/j. This is a contradiction. 

Therefore, we have pi > a/i for all i. By summing up over all i and using Ym=iPi ^ 1> we S e t 



N N 

1 > ^Pi >"^t = aH N . 



i 

i=i i=i 



□ 



4.2 Maximizing the expected weight 

A slightly different model arises when elements arrive with (random) weights and we want to 
maximize the expected weight of the selected element. This model is somewhat easier for an 
algorithm; any algorithm that selects the best element with probability at least a certainly achieves 
an a-approximation in this model, but not the other way around. Given an upper bound iV on the 
number of elements (and under a more stringent assumption that weights are chosen i.i.d. from 
a known distribution), by a careful choice of a probability distribution for the weights, we prove 
that still no algorithm can achieve an approximation factor better than an ri(log N/ log log N)- 
approximation. 

Theorem 4.3. For the classical secretary problem with random nonnegative weights drawn i.i.d. from 
a known distribution and the number of candidates chosen adversarially in the range {1, . . . ,N}, 
no algorithm achieves a better than 32 ^ ^ N -approximation in expectation. 
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The hard examples are constructed based on a particular exponentially distributed probability 
distribution. Similar constructions have been used in related contexts [16, 10]. Proof. We define a 
probability distribution over weights as follows. For a parameter 7 G (0, 3) (possibly depending on 
N), let the weight of each element be (independently) 



2 7J with probability 1/2- 7 , for each j > 1. 



Note that although the weights are unbounded, the expected weight of each element is finite. 

Consider blocks of elements where the i-ih. block Bj has size 2*. The adversary will choose 
arbitrarily a number of blocks £ < log N, and a stopping point n = £j = j 2* = 2^ +1 — 1. Note that 
given £, the expected optimum is 



OPT e > w j p 



Wj is the largest weight among 2 elements 



The probability that no weight larger than Wj appears among 2 l elements is (1 — 1/2^ ) 2 . So 



wj is the largest weight among 2 elements 



(1 - 1/2*)* - (1 - 1/2* 
9(min{2^,l}). 



Therefore, since Wj = 2 7J and 7 G (0, 3), the expected contribution from elements of weight Wj 
is roughly 2 7: ' min{2 *, 1}, which is maximized for j = I. (Note also that the distribution decays 
exponentially both for j > I and j < £.) So the largest contribution comes from elements of weight 
roughly W£. We can estimate: 



OPT e > w £ P 



W£ appears among 2 elements 



2^(1 - (1 - l/2 £ ) 2 ') > (1 - l/e)2^. 



Now consider any algorithm (which does not know I beforehand). Let pi denote the probability 
that the algorithm skips the first i — 1 blocks and then chooses some element in block Bj . Note that 
this event might be correlated with the random weights that appear in blocks B\, . . . , Bj. However, 
we have a bound on the probability that weight Wj appears in block Bj\ 

P [wj appears in block Bj] = 1 - (1 - l/2 j ) 21 < min{2 i -' J ', 1}. 

Let ptj denote the probability that the algorithm gets an element of weight Wj from block Bj . By the 
above we have pij < mm{2 l ~ 3 , 1}. Also, by definition of the probabilities, Yl'jLi Pij = Pi- Given py, 
the expected weight that the algorithm obtains from block Bj is E [profit from Bj] = Yl'jLi w jPij 

and the total profit over the first £ blocks is Yli=i X^Li w jPij- Thus the expected profit of any 
algorithm can be bounded by the following LP. 



max 



a 



< logiV 
Vi, j 

V7 



ELi ££=1 w iPij ^ otOPTf, 
Vij <min{2 i -J',l}; 

Eoo 
j=l Pij — Pi'i 

ELi Pi < i; 
Pi > 0. 



19 



We estimate the value of this LP as follows. Subject to the condition X^=i Pij = Pit ^ ne quantity 
YlJLi w jPij wm b e maximized if we make p^ for large j as large as possible. However, note that 
^ J Pij < 2 7? 2 l ~ J , so the tail for j — > oo decays exponentially and we might as well concentrate only 
on the first term. Assuming that pi = 2 l ~ k , the best choice is to set p^j = for j < k and p^j = 2 l ~i 
for all j > A; + 1, which gives 

oo oo _. 

^w jPij < 2^2*-' < _ 2 (7-i)(fc+i)+i < 2 . ( 2 *-fc)i-7 2 7i 

i=i j=fc+i 

where we used 7 6 (0, |). Note that for any value of pi, we can apply this argument to the power 
of 2 nearest to pi\ hence, 

00 

3=1 

Now suppose the adversary stops the game after I blocks. The expected optimum is OPTg > 
(1 — l/e)2 7 ^ (see above), while the algorithm gets 

l 00 £ 



This should be at least aOPTi > a(l — l/e)2 7 ^; therefore, we get 

^;-¥«)>i(l-l/e)a>i ( 



-a. 

- 4 . 

i=i 

We sum up these inequalities for I = 1, . . . , log N: 

logiV £ logN log TV 



E E^~ 727( ^ = E ^ 1_7 E 27(l ^ } * 



£=1 i=l i=l l=i 

The sum Y^i=i 27(i " £) is bounded by 27(i ~ £) = < ~ ■ Therefore, we get 

ID ^ i_ 7 

01 ~ Y^gN E Pi • 

Given that ^fci^K = 1 an d ^ ne function rr 1 " 7 is concave, the best value of a can be achieved if 
we set pi = 1/log N for all i. Then, we have 

16 

a < — ; r-; . 

~ 7(logiV) 1 ~ 7 
Finally, we set 7=1/ log log N which gives 

32 log log N 



a < 



logiV 

□ 



Consequently, we obtain that no algorithm can achieve an approximation factor better than 
fi(log iV/loglogiV) in the AO-RA-MN model. 
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Corollary 4.4. For the matroid secretary problem in the AO-RA-MN (and AO-RA-MU, RO-AA- 
MU) models, no algorithm can achieve a better than £1 ( \^°f^ N ) - approximation in expectation. 

Proof. It is not hard to convert the example of Theorem 4.3 into a hard example for the AO-RA- 
MN model. It suffices to let M to be a partition matroid of rank 1, and let the first n elements 
of the inputs to be non-loop while the rest of the input contains only loops. Since the algorithm 
does not know n in advance (it only knows n < N), it essentially has to choose one of the first 
n elements without knowing n, which is a secretary problem where the number of candidates is 
chosen adversarially in the range {1, ... ,N}. Therefore, no algorithm can achieve an approxima- 
tion factor better than fi(log N/ log log N) (the same is also true for the AO-RA-MU, RO-AA-MU 
model, where nothing is known about n in advance). □ 



5 Conclusion and open questions 

We presented a number of positive and negative results for variants of the matroid secretary prob- 
lem. The main open question is if there is a constant-factor approximation in the RO-AA-MN 
model, where weights are assigned to elements adversarially and the input ordering of elements 
is random. An easier question might be whether this is possible in the RO-RA-MN model where 
both the input order and weight assignment are random, but only the total number of elements 
n is known in advance (as opposed to the full matroid structure, as in [21]). Note that under an 
adversarial assignment of weights, knowing the matroid beforehand (RO-AA-MK) does not seem 
to be easier than the RO-AA-MN model; the true input could be embedded in a much larger ma- 
troid with most weights set to zero. A similar question arises for the AO-RA-MN model: whether 
it is possible to improve the O(lognlogr) factor, thus closing the gap with the lower-bound of 
SI (log n/ log log n). 
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